Finetuning Qwen2.5-3B using DPO with Unsloth

Finetuning Qwen2.5-3B with DPO using Unsloth on TinyStories prefrence dataset

Finetuning
DPO
Unsloth
Qwen
Author

Quang T. Duong

Published

August 24, 2024

Getting started GenAI & LLM with my Udemy course, Hands-on Generative AI Engineering with Large Language Model 👇

🤝 What is

Example of configuration: